Fix TOML parse failure when number token hits buffer edge #356

yawkat · 2022-11-22T17:01:31Z

When a number token is exactly at the end of the lexer text buffer, the parser would advance the lexer, triggering a buffer refill, before the number is parsed from the buffer. This patch moves the advance operation to come after parsing, which resolves the issue.

Kudos to @wbprime for finding this and reporting it as micronaut-projects/micronaut-toml#93 . Not even the fuzzer managed to hit this – maybe because it does not fail fast, and just returns the wrong output (NumberInput does no input checking)?

@wbprime

When a number token is exactly at the end of the lexer text buffer, the parser would advance the lexer, triggering a buffer refill, before the number is parsed from the buffer. This patch moves the advance operation to come after parsing, which resolves the issue. Kudos to @wbprime for finding this and reporting it as micronaut-projects/micronaut-toml#93 . Not even the fuzzer managed to hit this – maybe because it does not fail fast, and just returns the wrong output (NumberInput does no input checking)?

yawkat · 2022-11-22T17:13:38Z

hm outstanding test failures...

When a number token is exactly at the end of the lexer text buffer, the parser would advance the lexer, triggering a buffer refill, before the number is parsed from the buffer. This patch moves the advance operation to come after parsing, which resolves the issue. Also tracked as FasterXML/jackson-dataformats-text#356 Fixes #93

yawkat · 2022-11-22T17:17:05Z

i think the fuzz tests just fail because of illegal tokens now, but as long as there's a failure, this seems fine

cowtowncoder · 2022-11-22T17:46:26Z

@yawkat Correct, many/most methods in NumberInput expect caller to pass valid Strings (since that's what original JSON parser does before calls anyway). This is good for performance but can in fact hide errors like buffer boundary ones.
Fuzzer also has the challenge that it can find buffer boundary if it results ArrayIndexOutOfBoundsException but not if it just produces invalid output (like you suggested).

When a number token is exactly at the end of the lexer text buffer, the parser would advance the lexer, triggering a buffer refill, before the number is parsed from the buffer. This patch moves the advance operation to come after parsing, which resolves the issue. Also tracked as FasterXML/jackson-dataformats-text#356 Fixes #93 Co-authored-by: Tim Yates <[email protected]>

yawkat requested a review from cowtowncoder November 22, 2022 17:01

change fuzz test errors

d3965dd

yawkat mentioned this pull request Nov 22, 2022

Fix TOML parse failure when number token hits buffer edge micronaut-projects/micronaut-toml#95

Merged

cowtowncoder approved these changes Nov 22, 2022

View reviewed changes

cowtowncoder merged commit 0ba64a6 into 2.14 Nov 22, 2022

cowtowncoder deleted the chunk-edge branch November 22, 2022 17:48

cowtowncoder added a commit that referenced this pull request Nov 22, 2022

Update release notes wrt #356

9384493

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix TOML parse failure when number token hits buffer edge #356

Fix TOML parse failure when number token hits buffer edge #356

yawkat commented Nov 22, 2022

yawkat commented Nov 22, 2022

yawkat commented Nov 22, 2022

cowtowncoder commented Nov 22, 2022

Fix TOML parse failure when number token hits buffer edge #356

Fix TOML parse failure when number token hits buffer edge #356

Conversation

yawkat commented Nov 22, 2022

yawkat commented Nov 22, 2022

yawkat commented Nov 22, 2022

cowtowncoder commented Nov 22, 2022